INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE 



c 

(N 
oc 



ESS for life-history traits of cooperating consumers 

facing cheating mutants 

p. Bernhard — F. Grognard — L. Mailleret — A.R. Akhmetzhanov 



u 
o 

TO 



(N 
> 

(N 
> 

X 

TO 



N° 7314 

June 2010 
.Theme BIO 




WflNR^IA 

^^^^ SOPHIA ANTIPOLIS 



ESS for life- 



history traits of cooperating consumers 
facing cheating mutants 



P. Bernharc^!! , 



F. GrognarcH , L. MailleretH , A.R. Akhmetzhano\i] 



Theme BIO — Systemes biologiques 
Pro jet Comore 



Rapport de recherche n° 7314 — June 2010 — [18] pages 



Abstract: We consider a population of identical individuals preying on an exhaustible 
resource. The individuals in the population choose a strategy that defines how they use their 
available time over the course of their life for feeding, for reproducing (say laying eggs), or 
split their energy in between these two activities. We here suppose that their life lasts a full 
season, so that the chosen strategy results in the production of a certain number of eggs laid 
over the season. This number then helps define the long-run evolution of the population 
since the eggs constitute the basis for the population for the next season. However, in this 
paper, we strictly concentrate on what occurs within a season, by considering two possible 
strategies: the collective optimum and the uninvadable strategy. 

The (collective) optimal strategy involves a singular arc, which has some resource saving 
feature and may, on the long-term, lead to an equilibrium. However, it is susceptible to 
invasion by a greedy mutant which free-rides on this resource saving strategy. Therefore, 
the "optimal" strategy is not evolutionarily stable. 

We thus look for an evolutionarily stable strategy, using the first ESS condition ("Nash- 
Wardrop condition"). This yields a different singular arc and a strategy that conserves less 
the resource. Unfortunately, we show in a forthcoming paper that, though it is an ESS, it 
is not long-term (interseasonal) sustainable. This is an instance of the classical "tragedy of 
the commons". In this paper we also investigate whether the mixed strategy on the singular 
arcs may be interpreted in terms of population shares using pure strategies. The answer is 
negative. 

Key-words: Differential Games, Population Dynamics, Resource- Consumer Model, Evo- 
lutionarily Stable Strategy 
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Stabilite evolutionnaire des strategies d'histoires de vies 
de consommateurs solidaires confrontes a une invasion de 

mutants 

Resume : Nous considerons une population d'individus identiques s'attaquant a une 
ressource epuisable. Les individus de la population doivent choisir une strategie qui definit 
comment ils utilisent leur temps disponible au cours de leur vie a se nourrir, pour la re- 
production (par exemple la ponte), ou en partageant leur energie entre ces deux activites. 
Nous supposons ici que leur vie ne dure qu'une saison complete, de sorte que la strategie 
choisie se traduit par la production d'un certain nombre d'oeufs pondus au cours de la sai- 
son. Ce nombre permet alors de definir revolution a long terme de la population car les 
oeufs constituent la base de la population pour la saison suivante. Dans ce rapport, nous 
nous concentrons strictement sur ce qui se produit au sein d'une saison en considerant deux 
strategies possibles: I'optimum collectif et la strategie non invasible. 

La strategie (collective) optimale definit un arc singulier le long duquel la ressource est 
economisee; sur le long terme, cette strategie pent conduire a un equilibre. Par centre, cettre 
strategie rend la population sensible a I'invasion par un mutant glouton qui n'economiserait 
pas la ressource sur Tare singulier. Par consequent, la strategie "optimale" n'est pas evolu- 
tionnairement stable. 

Nous allons done rechercher une strategie evolutionnairement stable, en utilisant la pre- 
miere condition d'ESS ("condition de Nash- Wardrop") . II en resulte un autre arc singulier 
et une strategie qui preserve moins la ressource. 

Dans cet article, nous verifions aussi si une strategie mixte sur un arc singulier pent 
etre interpretee comme une representation d'une population dont une partie se reproduit 
continuellement et I'autre se nourrit. La reponse est negative: la strategie mixte ne pent 
representer des parts de populations jouant des strategies pures. 

Mots-cles : Jeux differentiels, Dynamique des populations, Modele consommateur- 
ressource, ESS 
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1 Introduction 

1.1 Context and contribution 

1.2 Context and contribution 

The evolution of life history traits, e.g. the reproduction strategy an organism should follow 
during the course of its life, is an important topic of evolutionary ecology [51 [16]. In this 
respect, seminal contributions based on optimal control theory principles have focused on 
the optimal energy allocation schedules of organisms into growth or reproduction (see the 
reviews [TSl [H [7|). These studies consider an "energy allocation" trait, the proportion of 
energy an organism invests at a given A moment of its life into somatic growth or into re- 
production forms, and look for dynamic (age and state dependent) strategies that maximize 
the reproductive output of the organism. They show in particular how different growth- 
reproduction dynamical strategies, like the determinate and indeterminate growth patterns, 
may emerge from different hypotheses on the biology of the organisms ([HI [3]). These stud- 
ies are however based on models at the individual level, and thus do not address population 
issues and the potential feedbacks of population densities on the life history strategies that 
should be promoted by evolution. 

The eventual success of a strategy does however not necessarily depend on its optimal 
exploitation of their ressource: a species indeed participates in games with other species and 
mutants of its own which could take advantage of its "optimal" strategy; evolutionary game 
theory is then the general framework for the study of which strategy is selected through 
evolution. In that framework, the Adaptive Dynamics approach A is a recent methodology 
specifically tailored to tackle evolutionary questions arising in a population context [SJ [3] . 
Based on the (un)-invasibility analysis of rare mutants into a resident population at eco- 
logical equilibrium, the studies adopting this framework have mostly concentrated on scalar 
or multi-dimensional traits. As such, they fairly overlooked potential dynamical life his- 
tory traits which are by essence of an infinite-dimensional nature (see however [?] for an 
exception) . 

In this contribution, we shall build on the model proposed by [1] to describe the A in- 
teractions between populations of seasonal species of consumers and resource organisms. 
This model is of a semi-dicrete type [2]: the intra-seasonal interactions between the mature 
consumers and resource populations as well as production of immature forms are modelled 
with a continuous time system, while the inter-seasonal demographic processes, essentially 
maturation of the immature forms and death of the mature populations, are modelled with 
a discrete mapping. As such, this model allows to account for dynamical strategies {e.g. 
feeding or reproduction of the consumers) at the intra-seasonal level, as well as their conse- 
quences on populations at the long term scale through the investigation of the inter-seasonal 
dynamics. More precisely, [1] studied the optimal feeding-reproduction strategy of the con- 
sumer population; it can be looked at as an extension of the works on optimal energy 
allocation described above to an explicit populational context. The current consensus is 
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however that evolutionary studies should rely on invasibility criteria (as the Adaptive Dy- 
namics methodology does) rather than on optimization criteria |13l I12| . 

The objective of the present contribution is to extend the work of [T] to investigate "un- 
invadable" dynamic strategies, i.e. strategies that can not be beaten by alternative strategies 
followed by some mutants taking advantage of their small density. In particular, we show 
that the optimal cooperating strategy of the consumers computed in [T] can be invaded 
(see also [5] for a deeper study of this issue), and that consumers may adopt an alternative 
Evolutionarily Stable Strategy (ESS) which prevents any mutant invasion. Though it will 
not be developed in the present paper which concentrates on within season strategies, we 
can show that, contrarily to the "cooperative" or "optimal" strategy that can lead to a long 
term co-existence of resource and consumer populations, the ESS strategy always leads to 
an evolutionary suicide of the consumers in the interseasonal dynamics. 

1.3 The model 

We consider an homogeneous population of essentially fixed size. The energy level of a 
representative individual is p £ R+, the proportion of its time spent feeding is m G [0, 1], the 
amount of available resource (say the preys population size) is n G W^. The dynamics are 
as follows, with a, b and c positive coefHcients: 

p ~ —ap + bnu , (1) 
n ~ —cnu . (2) 

It should be noticed that these dynamics leave the strictly positive othant invariant. We 
may therefore exploit the following feature of these dynamics: The ratio x = p/n obeys the 
scalar dynamics 

X = —ax + {b + cx)u . (3) 

The criterion for long term sustainability is the number of eggs layed during a season, of 
length T, given as 

J = [ {l-u{t))p{t)dt. (4) 
Jo 

This criterion is not expressible in terms of x only. 

A small population of potential invaders will be considered, with the same energy dy- 
namics 

Pm = -apni + bnum (5) 
and criterion ^ 

Jm= I (1 - Um)prn At , (6) 

Jo 

but with no effect on the resource amount because of the negligible size of that population. 
It gives rise to the variable Xm = Pm/n with dynamics 

Xjri — aXfYi ^ bu^i -\~ CXjYiU . ^T) 
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2 Collective optimum 

2.1 Euler Pontryagin equations 

We use a standard maximum principle approach to finding the optimal collective behavior. 
To take advantage of the reduced size dynamics, let V be the Bellman function of this 
problem. We shall show that it has a solution of the form 

V{t, n, p) = nil {t, x) . 

If this is so, 

dV _ ^dU dV _^dU dV _ ^ ^dU 
dt dt ^ dp dx ^ dn ' dx 

The Hamilton Jacobi Caratheodory Bellman equation thus reads 

= 0. 



dU 

n— — h max 

dt «e[o,i] 



dU, , , dU , ,^ , 

-——[ap — bnu) — (7 — x—— cnu + (1 — ujp 
dx \ dx ' 



Again, we may divide through by n to obtain 
dU 



— — h max 

dt tie[o,i] 



dU 

-g—[ax ~ [b + cx)u] — Ucu + (1 — u)x 



0, (8) 



with the terminal condition 

Va; e ]R+ , U{T, x) = . 

Introduce the notations 

dU dU 
v[t) = —{t,x{t)), \{t)^—{t,x{t)). 

The equations of the characteristics of this PDE, or the Euler-Pontryagin adjoint equations, 
yield 

V = cuv, v{T) = , , , 

A = aA - 1 + u , A(r) = , ^ ' 

with u £ argmax„{[A(6 + cx) — x — cU]u}. We further simplify this with the additional 
notation ^ = U — Xx which is just dV/dn. Using © and ([5]), we get 

fi ^ {c^i - b\)u , ^(T) = 0, (10) 

and that the optimal u is given by 

u{t) e argmax„{cr(t)-u} , (11) 
a = bX — cfi — x. (12) 
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2.2 Field of trajectories 
2.2.1 The switch line 

At i = T, a{T) = —x{T) < 0. Therefore, the final optimal control is u = 0. The final part 
of the optimal state and adjoint trajectories are thus 

x{t) = x(T)e'^(^-*) , X{t) = ^ (l - e--^^^-*)) , ^i{t) = , 

hence a{t) = bX{t) — x{t) vanishes on a switch line 

x = bX={b/a)[l~e"''-^-*^]. 

On this curve, the optimal control switches from 1 to 0. 

A terminal trajectory with u ~ can emanate from this switch line only if its slope is 
larger than that of the switch line itself, i.e. if 

^ax > -6e-°(^-*) 

which, placed back in the equation of the switch line yields 

x<x = b/{2a) and t > i = T - {ln2)/a . (13) 

Therefore, the switch line extends only from this (i, x) to x{T) = 0. The last primary 
trajectory, passing through x{i) = x has x{T) = b/ (4a). The trajectories with x{T) > b/ (4a) 
have M = as their optimal control for all t < T. 



2.2.2 Tributaries of the switch Hne 

Trajectories with x{T) < b/{Aa) have a corner on the switch line. Prior to reaching it, they 
have M = 1, hence 

i = (c — a)x + b . 

These trajectories, reffered to as the tributaries of the switch line, are as follows. Let tg, 
Xs = {b/a)[l — e""^"^"*"-*] be their end point on the switch line. We get 

^ (j:^ + -±-) e-(--)(*-*) _ -±- . 
\ c — a J c — a 

We show in the appendix that indeed, cr does not change sign on these trajectories. They 
have x(-) increasing with time. As a matter of fact, we have i > if c > a or, if c < a and 
X < b/{a — c). But since on the switch line, x < b/{2a) < b/{a — c), this is always satisfied. 
Hence all such trajectories come from the time axis. However, it is easy to check that, if 

T>iln2+^ln£±^, 
a c — a 2a 

the trajectory ending at x(t) = x comes from x(t) = for a positive i, and all other, which 
have a smaller x{t), a fortiori. Hence in that case none of the trajectories described so far 
accounts for an initial condition a::(0) < (6/4a)e°'^. 
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2.2.3 Singular line and tributaries 

The space between the primary trajectory x{t) = b/ (4a) exp[a(r — t)] and the tributary 

^(t) ^b(^ + e-(--)(*-*) - 

\2a c — a J c — a 

is not accounted for so far. In that space, we construct a singular line ending at {t,x). We 
differentiate twice the expression for <t ([^^ . using ([5)1 . (|!?|) . PH]) : 

a = ca + abX + ax — b , 

a = ca + a[{2b + cx)u + abX — 6 — ax] . 

Using (7 = and hence a = abX + ax — b = 0^ we get 

—a = {2b + cx)u — 2ax . 
a 



Hence, a singular control is 



With this control. 



2^^^ - . (14) 



2b 



>0. 



b + 2cx 



Hence in particular x{t) < x = b/{2a). And towards the negative times, the asymptote is 
X = 0. More precisely, the singular arc satisfies 

, i 26/1 1\ , 
In - + — - - - = a(i - i) , 

X C \X X J 

which can be solved for x in terms of t with the help of the Lambert function W as 

-1 



2b 
X = — 

c 



Wo (^^e"(^-*)+4a/c^ 



a perfectly useless formula . . . 

We check that yields u < 1. This is true if (2a — c)x < 2b. This is always true if 
2a < c, and, if 2a > c provided that x < b/ (2a — c). But we have seen that on the singular 
arc, X < b/{2a) < b/{2a — c). Hence the conditionn is always satisfied. 

Substituting into © and (|10p . and using ([T^ . one obtains 

A = --x, A = , 

a b 

2 . b 2x 

I^L = X , /i = . 

c ac c 
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Now, tributaries can reach this singular arc at any of its points with either u ~ 1, coming 
from "under" the singular arc, or with u = coming from "above"'. Let (t^,x^) be a point 
where a tributary reaches this singular line. The tributary with u = satisfy 

X=-- ^e-'-f'^-*) , 
a b 

^ ^ - ( - - 2x' 
c \a 

and hence 

cr = 2x"(l-ch[a(i"-0]) <0. 

Therefore, the optimal control remains it = all along. The analysis of the tributaries with 
u = 1 is done in the appendix together with those of the switch line. 



3 Uninvadable strategy 

3.1 "Optimal" strategy can be invaded 

We consider a very small sub-population of mutating individuals behaving differently. Let a 
subscript m stand for variables concerning that population, with .t,„ = Pm/n. It is so small 
that it has no effect on resource depletion. Its dynamics are given by (O and ([7]) , while the 
dynamics ([T]) and ^ are unchanged, and therefore also Its fitness is measured by the 
criterion ([6]). 

Let us consider u ~ (j){t, x) given, and control seeking to maximize J„i- The same 

construction as above leads to the Value function VmiPm,n)Um{t, Xm)-, with the Hamilton 
Jacobi equation 

dU^ - ^^IlIt{aXm - bu„, + CXmU) - UmCU + (1 - U,n)x„i = . (15) 

ot ox„i 

It should be pointed out that this equation holds true with Vm = nUm{t,Xm) the fitness 
obtained by the mutant, even if it does not "optimize". Using the notations 

_ dUm . _ dU,n _ 

ot OXm 

its characteristics are 

l>m=CUV„i, V„i{T)=0, 

Xm^aXm-l + Um, A„(r)=0, (16) 
Differentiating with respect to Um, we find that 

(t)v„At) dt. 



■Vm = / cr,„(t)Wm(i)< 

Jo 
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with 

O'm = bXjn — Xm ■ (17) 

If the mutant behaves as the resident population, the variables indexed m are equal to the 
corresponding variables without the index. On the singular arc, this leads to am = cr + cjjL. 
We also have, on this arc, using cr = 6A — c/i — a; = 0, 

a = [cu — bXju — — — < . 

^ ^ > 2b + cx 

Since = 0, it follows that for t < t, > 0, hence (T,„ > 0, which means that the 
mutant can improve its fitness by choosing Um = 1- Hence a mutant may appear that will 
be more fit than the resident population: an invasion. 

This is not surprising: the singular arc corresponds to a time interval during which the 
resident population cooperates to spare some resource. The mutant, who does not deplete 
that resource, has no incentive to spare it and can use as much as it wants. We defer to 
a future paper the analysis of the optimum behaviour of the mutant in this cooperating 
"optimum" population. 

3.2 An uninvadable strategy 

Since the "optimal" strategy can be invaded by a mutant, there is a priori no reason for this 
strategy to survive on the long-run. Therefore, we seek a strategy for the resident population 
that could not be invaded. For that purpose, we seek a control law u = (t>{x, t) such that 
the optimum mutant strategy will be 

Ura=U. (18) 

Therefore, there will be no possibility for a mutant to be more fit than the resident. This 
will lead to a new type of singular arc. 

Close to final time, we have am = 0, therefore the optimal behaviour of the mutant is 
Um = 0. We choose m = to insure the equality (fT8|) . We obtain the same field as previously, 
from the switch line on, up to time T . 

We attempt to construct a singular line for the mutant, attaching to it, x), and satisfaying 
(|18p . This means that the dynamics of both the state and the adjoints are as in the previous 
analysis, because of (|18p . but cr„, is as in p7|) . Differentiating along the singular arc, we get 

am — baX — b + ax — cxu 

hence, am — translates into 

ax + abX~b 2ax — b „, , 
u = = ~: u (X) . . (19) 

cx cx 

The second form above is obtained using am = 0. Interestingly, the first differentiation of 
am yields a definite expression for u, contrary to what is observed in classical singular arcs. 
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The reason is that this u is not the optimizing one, say Um, but the control of the resident 
population. However, we wish that both be equal. This also leads to Xm — x. Yet, the 
formula (fTH)) should not be considered a feedback 

2aXm — b 
u = 

which would cause Xm, and therefore u^, to influence the mutant's dynamics, resulting in 
a different set of adjoint equations. The resident's control and state histories should be 
considered as fixed a priori, independant of the mutant's action. They are just chosen in 
such a way that the optimal mutant's variables coincide with them. 
Substituting u'^ (fTO|) in the dynamics, this yields 

X = — [acx^ + bi2a - c)x - b^] . 
cx 

It is easy to see that under these dynamics, as t — > — oo, x{t) — >■ x with 

b 



2ac 



-2a + c+ \J ^a} + 6 



Notice also that the formula for u in pT?|) is increasing in a;, as is easily seen. Moreover, 
on the singular arc, x €! [x, i], thus on the one hand, u > u^ix) = 0, and on the other hand 
after a straightforward calculation: 



(^) = 2 2c <2- 

Hence this is a feasible control. 

The tributaries of this singular arc are investigated together with those of the cooperative 
case in the appendix. 

It is worth mentioning that, at the final point (i, x) of this new singular arc, we have for 
the singular control u = 0. Hence the singular arc has the same slope as the switch line, 
(and as the primary tangent to it). The singular arc and the switch line form together a 
"smooth" curve. Such a feature is known to be the rule in zero-sum two-person differential 
games. The next section explains why. 



4 Further remarks 

4.1 A zero-sum game formulation of Wardrop equilibria 

Let a "resident" population "chooses" its behaviour, or strategy, u in some strategy space 
14, while a potential mutant — or cheater, or free-rider — would use a strategy Um G U. Let 
J{u,Um) and Jm(u,Um) be the fitness of the resident and mutant population respectively 
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Figure 1: The field of trajectories of the ESS. Green solid line: the singular arc. Dotted 
green line: the singular arc of the cooperative solution. 

under this scenario. By hypothesis, if the "mutant" behaves as the resident, there is no 
difference between them in terms of fitness, hence 

J{u,u) = J„i{u,u) . (20) 

A strategy u* G W is said to be uninvadable, or be a Wardrop equilibrium \17\ , if a A mutant 
cannot do better than a resident population using that strategy, i.e. if 

Let 

AJ{u,v) = J„i{u,v) - J{u,v) . (21) 
Equivalently, the uninvadability property can be stated as 

sup AJ{u* ,u„i) <0. 

However, it follows from (|^n|) that AJ{u* ,u*) = 0, so that the supremum above is always 
non-negative. Hence, a strategy u* is uninvadable, or a Wardrop equilibrium, if and only if 

max AJ(u*,u,„) = AJ(u*,u*) = 0. 

And since this supremum is always non-negative, it follows that there exists a Wardrop 
equilibrium if and only if 

min sup AJ(u,u„j) = 0. 
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Therefore, the search for a Wardrop equihbrium can be performed as follows : solve the 
zero-sum two-person game min^ max„^ A J(u, Um), and if the min exists, check whether the 
min-sup reached yields a Value zero. If yes, the minimizing w is a Wardrop equilibrium. If 
furthermore the maximum in m„j is unique, then this is also an evolutionary stable strategy, 
or ESS [TU] . This is the case in the above problem. 

4.2 Homogeneous problems 

The following piece of theory can undoubtly be deduced from Noether's theorem [T31. We 
offer a simple direct derivation. It accounts for the transformation we used via the intro- 
duction of the variable x. 

4.2.1 The problem 

Let a control problem be given by a state y £ M", a control m g U, dynamics 



with all regularity and growth assumptions to guarantee existence of the solution for all 
measurable control functions u{-) E A^(IR+ ^ U). 

A terminal condition Tiy) = 0, T{-) G C^{R"-), meaning that ii = in{{t \ T{y{t)) = 0}, 
and for simplicity we assume that all feasible trajectories are transverse to that differentiable 
manifold. A criterion to be minimized by the control action is given, of the form 



Assumptions 

1. The functions /, T, K and L are homogeneous of degree one in y. 

2. The state variable y„ remains strictly positive for all control functions. 

It follows from these assumptions that if (j) stands for /, T, K or L. and if stands for 

the first n — \ components of y, there exists a function of 77. — 1 variables noted such that 



y .fiy,^), 2/(0) =2/0, 





Let also x = (l/?/™)?/^""^^. Then, we claim 



Theorem 



1. The reduced state x(-) obeys the dynamics 



x^g{x,u)^P '^{x,u) - xfn{x,u) , 



INRIA 



ESS for life-history traits of cooperating consumers facing cheating mutants 



13 



2. The Value function V{y) of this problem is homogeneous : 

V{y) = ynU (-y^"-'A 
\yn J 

3. The function U solves the Hamilton Jacobi equation of the problem 

J{xo) = yn{T)K{y{T)) + / ' y„{t)L{x{t), u{t)) dt . 

Jo 

where 



yn(i) := yn(0)exp / fn{x{s) , u{s)) ds 



4.2.2 Proof of the theorem 
Dynamics Direct calculations result in 

it = —fi{y,u) - ^fn{y,u) = gi{x,u) . 

yn yi 

Value function It is completely elementary to see that if the data are globally homo- 
geneous of degree one in x, so is the Value function. Therefore, there exists U such that 
=x„C/((l/x„)x"-i. 

Hamilton Jacobi equation Easy calculations yield 



and 



dV dU 



dxi 



Substitute these in the standard Hamilton Jacobi equation 



dV 

-g;^f{y,u) + L{y,u) 



0, 



to get (with X (l/y„)2/("-^') 
'dU 



max 



dx 



dU . 



ix)f''-'{y,u) + f„{y,u) Uix)~—ix)x] + L{y,u) 



dx 



0. 



If yn 7^ 0, we may write this as 
max y„ 



dU 



fn{x,u)U{x) + —{x)g{x,u) + L{x,u) 
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If furthermore yn{t) > 0, then we may divide through by ?/„ without changing the max 
operator. We recognize the Hamihon Jacobi Bellman equation of the problem with an 
exponential discount factor exp J/„(x, u). However, multiplied by the positive constant 
j/„(0), this exponential is indeed Unit). 

The terminal condition T{y) = translates into T{x) = 0, and the boundary condition 
of the H.J.B. equation into Vx : T{x) = 0, U{x) = K{x). 

4.3 Mixed strategy vs mixed population 

We want to investigate here the possible meanings of the mixed, singular strategy. Recall 
that in our model, u = means reproducing, while u — 1 means feeding. What is the 
meaning of an intermediary strategy u € (0, 1) ? 

One possible interpretation is that each individual in the population spends some time 
eating and some time reproducing in a fast cycle, as compared to the horizon of the problem 
— here a year — and to the characteristic frequency of the dynamics, here a and c. This thus 
a monomorphic population agreeing on a mixed strategy for all individuals. 

In linear problems, it is possible to think of an intermediary u as representing a poly- 
morphic population, where a fraction u uses the strategy u = 1 while a fraction 1 — ft uses 
the strategy u ~ 0. However, our criterion here is nonlinear. Therefore further investigation 
is needed. 

Let the population be represented by a measured space 51, with a positive measure fi 
of total mass fi{fl) = 1. (It is thus a probability measure.) Each individual is represented 
as an u Cz fl. Assume also that each individual uj has to pick, at each instant of time, a 
control v{uj,t) G {0, 1}. We need to assume that uj n- v{uj,t) is /x-measurable for (allmost) 
all t E [0,r], and that, for (alhnost) all w G 17, 1 1-> v{u!,t) is piecewise constant, equal to 
or 1 on time intervals (hence Lebesgue measurable). 

Each individual has an energy 7r(a;,t) at time t. Its dynamics are 

'k{uj, t) = — a7r(tj, t) + hn{t)v{u!, t) . 
The rsource depletion rate is 

n = —cn / v{uj,t) dfj,{uj) . 
Jn 

So, if we set 

u{t) = / w(w,i)d//(w) = ij{{uj I v{uj,t) = 1}) 
Jn 

and 

p(t) = / 7r(w,t)d^(w), 
Jn 

we find equations ((T]) and ([2]), as it should. 
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The number of offspring produced by an individual is 



/(^) 



{l-v{uj,t))Tr{uj,t) dt, 



and for the whole population (using Fubini's theorem) 

i-T r 



F = / fiu;) dfiiuj) 
In 



(1 — i))7r(w, t) d^{uj) 



dt 



which would coincide with the formula ^ for J only if 7r(aj,t) and v{uj,t) were proba- 
bilistically independant, an impossible situation if each player uses a constant control on 
nonzero, measurable, time intervals. We would approach such an independance if we were 
to assume that the time intervals during which the individuals 'controls are constant are 
extremely short, and after each, the set of individuals using w ~ 1, say, is drawn at random, 
with probability independantly of 7r(aj,t). But this is reconstructing a monomorphic 

population of mixed players. 



A Switch function on feeding tributaries 
A.l All tributaries with u = 1 

Let ts, Xs, As, fis stand for the variables t, x, X and /i at the point where a trajectory with 
u — I meets either the switch line or a singular arc, either cooperative or uninvadable. To 
simplify the notations, let also 



On this trajectory, we have 



X[t) = [ Xs -\ 

\ c — a 
A(t) = A^a, 

fi{t) = /is7+— 

c — a 

We want to investigate the sign of 

a = bX — cfi — X , 

remembering that in all cases to be investigated, a{ts) = 0. We have already noticed that 

a = ca + abX + ax ~ b . 
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Hence 



-'=(^-*)(a6A(T) + ax{T) - 6) dr . 



(22) 



We wish to show that L{t) abX{T) + ax{T) — 6 < 0, thus proving that a is positive from 
t = to ts- Substituting the exphcit values of x, A and we get 



L{t) = a 



. X 7 ^ 
oAga + Xs — \ 



(-- 


-) 




a/ 



A. 2 Tributaries of the switch Une 

On the switch hne, Xs = bXs and ~ 0. Hence 



A,, 



7 



1 n c\ 

— a Va a/ 



We notice that i(ts) = a5(2As — 1/a)- But we know that on the switch hne As < l/2a. 
Hence L{ts) < 0. Let us investigate its time derivative: 

\L{t) = Xs (aa + (c - a)-) + - = aA^a + ((c - a)Xs + 1) - ■ 



ab 



a 



Hence > {—aXg + l)7/a > 0, and therefore, L{t) < for aUt t < tg. 

A. 3 Tributaries of the singular arcs 

We now have 

L{ts) = a{bXs + Xs) — b . 

We claim that Xs < b/a. As a matter of fact, on the singular arc of the cooperative solution, 
Xs < b/2a. On the singular arc of the uninvadable solution. 



Xs X — 

2a 



V4a 



It is easy to chack that \J 4a'^ + c? — 2a < c, so that indeed, Xs < b/a, and therefore, 
L{ts) < 0. Again, compute its time derivative: 



L{t) = a abXsU + ((c — a)xs + b) ■ 



> 



again thanks to & — axs > 0. So, again, we may conclude that for all t < ts, L{t) < 0. 

In all three cases, we conclude with the help of formula (22) that on these trajectories, 
for all t < ts, a{t) > 0. 
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